Dataset Vertical Partitioning for Rough Set Based Classification

نویسنده

  • QASEM A. AL-RADAIDEH
چکیده

Dataset partitioning problem involves the vertical partitioning of the classification datasets into suitable subsets that preserve or enhance the classification quality of the original datasets. Typical classification model needs to be constructed for each subset and all generated models are then combined to form the classification model. This paper presents a dataset partitioning approach for rough set based classification. In this approach, the dataset is partitioned into two mutually exclusive subsets. Local reduct set is generated for each attribute subset which is then combined and used to generate the set of classification rules. A preliminary experimental result using the partitioning approach over some standard medical datasets showed that the approach preserves the classification accuracy.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Framework for Optimal Attribute Evaluation and Selection in Hesitant Fuzzy Environment Based on Enhanced Ordered Weighted Entropy Approach for Medical Dataset

Background: In this paper, a generic hesitant fuzzy set (HFS) model for clustering various ECG beats according to weights of attributes is proposed. A comprehensive review of the electrocardiogram signal classification and segmentation methodologies indicates that algorithms which are able to effectively handle the nonstationary and uncertainty of the signals should be used for ECG analysis. Ex...

متن کامل

A Hybrid Approach to Continuous Valued Datasets Classifying based on Particle Swarm Optimization, Variable Precision Rough Set Theory and Modified Huang-index Function

This paper proposed a new hybrid method, designated as PSOVPRS-index method, for partitioning and classifying continuous valued datasets based on particle swarm optimization (PSO) algorithm, Variable Precision Rough Set (VPRS) theory and a modified form of the Huang-index function. In contrast to the Huang-based index method which simply assigns a constant number of clusters to each attribute a...

متن کامل

Diagnosis of the disease using an ant colony gene selection method based on information gain ratio using fuzzy rough sets

With the advancement of metagenome data mining science has become focused on microarrays. Microarrays are datasets with a large number of genes that are usually irrelevant to the output class; hence, the process of gene selection or feature selection is essential. So, it follows that you can remove redundant genes and increase the speed and accuracy of classification. After applying the gene se...

متن کامل

A hybrid filter-based feature selection method via hesitant fuzzy and rough sets concepts

High dimensional microarray datasets are difficult to classify since they have many features with small number ofinstances and imbalanced distribution of classes. This paper proposes a filter-based feature selection method to improvethe classification performance of microarray datasets by selecting the significant features. Combining the concepts ofrough sets, weighted rough set, fuzzy rough se...

متن کامل

Indicator Selection based on Rough Set Theory

A method for indicator selection is proposed in this paper. The method, which adopts the General Methodology and Design Research approach, consists of four steps: Problem Identification, Requirement Gathering, Indicator Extraction, and Evaluation. Rough Set approach also has been applied in the Indicator Extraction phase. This phase consists of 5 steps: Data selection, Data Preprocessing, Discr...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007